Unit 2: Data Quality Transforms 


Multiple formats for same data elements 
Different meanings for the same code value 
Multiple code values with the same meaning 
Field overuse: field used for unintended purpose 
Data in filler 

e Migration (ETL) errors 
Normalization inconsistencies 
Duplicate or lost data 


e Data structure problems 


Before you can implement an effective data quality project, you must understand the data 
quality framework. The data quality framework is a continuous cycle of activities completed in 
order, which include: measuring, analyzing, parsing, standardizing, cleansing, enhancing, 
matching, consolidating, and continuous monitoring. 
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Data Cleansing 
Figure 7: Data Quality - Assessment 


Start the process by using a data profiling application to quantify the number and types of 
defects in your data. 





2. Analyzing 
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